State tying for context dependent phoneme models
نویسندگان
چکیده
In this paper several modi cations of two methods for parameter reduction of Hidden Markov Models by state tying are described. The two methods represent a data driven clustering triphone states with a bottom up algorithm [3, 9], and a top down method growing decision trees for triphone states [2, 10]. We investigate several aspects of state tying as the possible reduction of the word error rate by state tying, the consequences of di erent distance measures for the data driven approach and modi cations of the original decision tree approach such as node merging. The tests were performed on the test corpora for the 5 000 word vocabulary of the WSJ November 92 task and on the evaluation corpora for the 3 000 word VERBMOBIL '95 task. The word error rate by state tying was reduced by 14% for the WSJ task and by 5% for the VERBMOBIL task.
منابع مشابه
Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملContext dependent phoneme duration modeling with tree-based state tying
In this paper, we propose phoneme duration modeling methods with tree-based state tying. Two kinds of phone duration modeling methods are suggested. The first is context independent phoneme duration model in which duration parameters are stored in each phone. The second is context dependent duration model in which duration parameters are stored in each state being shared by context dependent ph...
متن کاملAsynchronous-transition Hmm for Acoustic Modeling
We propose a new class of hidden Markov model (HMM) which we call Asynchronous-Transition HMM (AT-HMM) to model asynchronous temporal structure of acoustic feature sequences. Conventional HMM models a sequence of feature vectors, while temporally changing patterns of acoustic features do not necessarily synchronize with each other. In this paper, AT-HMMs with and without sequential constraints ...
متن کاملAutomatic question generation for decision tree based state tying
Decision tree based state tying uses so-called phonetic questions to assign triphone states to reasonable acoustic models. These phonetic questions are in fact phonetic categories such as vowels, plosives or fricatives. The assumption behind this is that context phonemes which belong to the same phonetic class have a similar influence on the pronunciation of a phoneme. For a new phoneme set, wh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997